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Abstract 

As a brief announcement [ 12 1, we proposed the following balls-into-bins strategy allocating n balls 
into n bins, denoted by [k, G?)-choice process. The placement process consists of n/k rounds, each of 
which k < d balls are placed into k least loaded bins out of d bins chosen independently and uniformly 
at random. In this paper, we present complete analysis for our preliminary results and provide tight upper 
and lower bounds for the maximum load that hold with high probability for any 1 < k < d < n. 

The entire spectrum of allocation schemes that lie between the boundaries of the single and multiple- 
choice algorithms is captured in this simple process. Our new results indicate that we can choose suitable 
parameters k and d to achieve the optimal tradeoff between the maximum bin load and message cost. For 
some k and d, the (k, <i)-choice process produces maximum load of O(l) whp at the expense of (1 + e)n 
messages for < e < 1 constant. This shows that our allocation scheme outperforms any known non- 
adaptive balls-into-bins process; any other algorithms achieving constant maximum load require higher 
message cost than what we have. For example, the standard multiple choice scheme requires 0(n log n) 
messages in order to achieve a constant maximum load. Furthermore, we present some applications in 
which (k, d)-choice is more efficient than the standard multiple-choice algorithm; i.e. the cost of item 
insertion and search is as low as (1 + o(l)) time on average. 







1 Introduction 



In the classical singe choice balls-into-bins problem, a ball is placed into a bin chosen independently and uni- 
formly at random (i.u.r.) The maximum load after n balls are placed into n bins is (l+o(l)) (log n/ log log n) 
with hight probability [14J. In the multiple choice paradigm, each ball is placed into the least loaded one 
out of d > 2 bins chosen i.u.r. Azar, Broder, Karlin, and Upfal in [2] showed that the maximum load is 
dramatically reduced to log log n/ log d + 0(1) in the (i-choice process where each ball is placed into the 
least loaded among d bins chosen i.u.r. Since then, a number of variants of the multiple choice processes 
have been proposed ffl |5j EJ [18] H |9j El E EJ [3j El and the goal in such algorithms is load balance. 
The maximum bin load is a commonly used measure that indicates the level of load balance. 

Many of the multiple-choice algorithms are sequential and non-adaptive, i.e. balls are placed sequen- 
tially into bins and each ball is associated with a fixed number of d possible bins. In non-adaptive schemes, 
the message cost is dn in total. However, in some applications where load balance is critical, one needs to 
increase the number of bin queries to d = f2(logn) to achieve 0(1) maximum load. There is a tradeoff 
between the maximum load and the message cost in non-adaptive schemes. In the aim of minimizing load 
balance as well as bin queries, Czumaj and Stemann [5] introduced adaptive sequential allocation algo- 
rithms that achieve log log n/ log d + O(l) maximum load and 0{d) bin queries per item, and (1 + o(l)) 
bin queries on average. Motivated by the case where the cost of probing bins is relatively high, Peres et al. 
lfT3l proposed (1 + /3) -choice algorithm; each time a ball is placed with probability (3 > a 2-choice scheme 
is used and with probability 1 — /3 a single choice scheme is used. They showed that the gap between the 
maximum and average bin load is ©(log re//3), regardless of the number of balls. Adler et al. [ 1 ] considered 
the standard multiple choice algorithm in parallel and distributed settings. They proposed non-adaptive par- 
allel algorithms that achieves 0(loglogn/logloglogn) maximum load and O (n log log nj log log log n) 
message cost in B(log log n/ log log log n) communication rounds. Recently, Lenzen and Wattenhofer [ 10] 
presented an adaptive parallel algorithm that achieves a bin load of two in log* n + O(l) communication 
rounds using O(n) messages in total. Refer to their paper for comprehensive survey on performance of 
various balls-into-bins algorithms. 

Previously, we considered the following questions [12]. If we place 2 balls at a time into 2 least 
loaded out of 3 bins chosen i.u.r, is the system still balanced in the sense that the maximum load is still 
0(loglogn)?. What if 5 into 8 bins or 99 balls into 100 bins? We generalized this idea and proposed the 
following allocation process, which we will call (k, cQ-choice process. For fixed k < d. if we place k < d 
balls at once into k least loaded bins among d possible destinations at a time, what is the maximum load of 
any bin after all n bins are placed into n bins? Without loss of generality, we assume that n is a multiple of 
k. The (k, cQ-choice process consists of n/k rounds, in each of which k < d balls are placed into k least 
loaded bins among d bins chosen i.u.r. . Bins are chosen with "sampling with replacement" policy Qand ties 
are broken arbitrarily. In this paper, we present a full analysis to our preliminary results presented in 2-page 
paper [12] and provide new and tight bounds on the maximum bin load that holds with high probability for 
any 1 < k < d < n. Many (unknown) allocation algorithms that lie between the boundary of the single and 
2-choice is captured in this simple process. Moreover, using the tight maximum load, we can find suitable 
values of k and d with which the (k, d) -choice process achieves the striking balance between the maximum 
load and message cost. For example, for k = c log n and d = (c + 1) log n with c > 1, the maximum load of 
(fc, d)-choice is at most log(c+ 1) + 0(1) whp and the total message cost is (1 + l/c)n with certainty. This 
indicates that a non-adaptive balls-into-bins algorithm can outperform any known balls-into-bins algorithms 

1 For example, in (2, 3)-choice, suppose that binl, binl, and bin2 have been chosen i.u.r. and binl is empty and bin2 has 1 ball. 
In this case, bin 1 will receive two balls. 
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including the ones using adaptive schemes, in the sense that the number of bin queries can grow unbounded 
to guarantee the constant maximum load. One drawback of our strategy in practice is that a search for an 



item seems to be costly since k balls are associated with d bins and d can be large. In Section 1.2 we discuss 
applications in which (k, (i)-choice maintains O(loglogra) maximum load and the average search cost is as 
low as (1 + o(l)) for any large d. 

The rest of the paper is organized as follows. In the following subsections, we summarize our main 
results and present applications in which the performance of (k, d) -choice is maximized. In Section [2j we 
discuss the model of the (k, d)-choice process and briefly summarize the main approach and ideas. Detailed 
analysis are provided in Section [3j Section [4] concludes the paper. 

1.1 Main Results 

Our main result can be formalized as follows. 

Theorem 1. Let 1 < k < d < n and d^ = j^r. Let M(k, d, n) denote the maximum load after n balls are 
placed into n bins using the (k, d)-choice process. The following holds with probability 1 — o(l). 

i) Ifd k = 0(1), then 

loE log Tl 

M( ^'" )£ i^HtT) +0<1) ' (1) 

ii) If dk — > 00 as n — > 00, then 

M(k, d,n) < (l + o(l))- — - — — + - — — — — — -. (2) 

log log d k log(d-k + l) 

Moreover, the bounds in ([7J and ([2]) are tight up to some additive constants. 

We discuss a number of interesting consequences derived from the results in Theorem [T] If k = 1, the 
maximum load coincides with the result of the standard d-choice process. At another extreme, if d = 0(n) 
and k rj d, the maximum load becomes (1 + o(l)) (log n/ log log n) from ([2]), the well known result from 
the single choice process. The true benefits of the (k, (i)-choice scheme lie somewhere in between theses 
two extreme ends. For example, k = clogn and d = k + logn for constant c > 1, the maximum load is 
log(c + 1) + 0(1) = 0(1) by (|2]). In this case, the total message cost is (1 + l/c)n, which outperforms 
any known balls-into-bins algorithms in the sense that (k, (i)-choice bounds the number of bin queries with 
certainty, while any algorithms with constant maximum load would incur higher message complexity than 
ours. 



1.2 Applications 

The (k, d) -choice allocation scheme can be naturally applied to storage systems based on consistent hashing 
|[T31[T2l . Suppose the following scenario. When a new file is created, it is replicated into k copies and each 
of the replicas is to be stored into different servers to increase file availability and fault tolerance. Or if the 
size of the file is large, to prevent load overhead of any server, it splits into k chunks and each chunk is stored 
into servers. When one needs to determine the location of those copies or chunks of a file, the (k, c?)-choice 
algorithm provides a simple and efficient solution to the question; choose d servers randomly and place them 
into k least loaded servers. The allocation process can be expedited by choosing d k = polylog(n) while 
maintaining O(loglogn) maximum load. To search a file later, it is sufficient to find one copy of the file if 
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the file has been replicated, or all the k chunks of the file need to be retr ieved if the file has been partitioned. 
The average search cost per item is d/k in the replication case, and if k = d — 1 and d is large, the search 
cost is (1 + o(l)) on average. In the case of data partition, searching all k chunks of a file takes 0(k + 1) by 
choosing d = k + 1. If the standard 2-choice was used, then the search cost would be 0{2k), almost twice 
more expensive than the searching in (k, d)-choice. 



2 Problem Model 

In this section, we discuss the model of the (k, d)-choice process and briefly summarize our main approach 
used in Section [3] 



2.1 Profoleof Bin Load 




Figure 1 : Normalized bins and bin load vectors 



Let b(r) = (&i(r), . . . , b n (r)) be a normalized load vector, where b x (r) is a random variable repre- 
senting the number of balls placed in the x's most loaded bin at the end of round r > 1. That is, at 
the end of round r, we order the bins from the most loaded to the least loaded (ties are broken arbitrar- 
ily) and we set fei(r) to be the load of the fullest bin, &2O") the load of the second fullest bin, and so on. 
Then the load vector b = (&i(r), . . . , b n (r)) fully characterizes the (k, <i)-choice process. For simplic- 
ity, we use b = (61, ... , b n ) to denote the load vector at the end of the final round n/k. Similarly, let 
b = (6i(r), . . . , b n (r)) denotes the bin load vector in the single choice scheme. 

Roughly speaking, the bin load is consistently decreasing as the bin number increases either in the d- 
choice scheme or the single choice scheme; the rate is double logarithmic in d-choice and logarithmic in 
the single choice. The main distinguishing feature of the (A;, d)-choice scheme is that the bin load does not 
decrease with consistent rate. More precisely, the profile of bin loads exhibit two different phases; first, 
the bin load decreases slowly as the bin number increases like d-choice, and after certain point is passed, 
it starts behaving like the single choice - decreasing fast. Interestingly enough, thephase shifting point is 
on the intersection between b and b, the single and (k, d)-choice bin (see Lemma pi), which is the crucial 
property used in our analysis. Let (x*, y*) be the intersection point. Abstract view of the normalized load 
vector is in Figure 1(a). 

Definition 1. For given load vectors h = (61, ... , b n ) and b = (61, ... , b n ), let x* satisfy the following 
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properties: 

Pr(6 x > s) < Pr(b x > s), if x < x* 
Pr(6 x > s) > Pr(6 x > s), if x > x* 

for s > 0. Let y* = Wfe call (x* , y*) f/ze intersection point ofh and b, denoted by b n b. 

To obtain the maximum load y = y* + y* in Figure 1(b), we compute and y* separately. In order 
to compute y* , we need to compute x* first. Note that y* represent the difference between b\ and 6^ . In 
Section |3j we compute tight bounds for y* and y*. 

2.2 Probability Vector 




Figure 2: Normalized bins and probability vectors 

Let p x denotes the probability that the x'th most loaded bin receives at least one ball in a round. The 
probability p x can be computed explicitly (see Lemma[T]). 

The probability vector p = (px, . . . ,p n ) determines the bin load vectors b(r) (and hence the (k,d)- 
choice process) as follows. Each round 1 < r < n/k, starting with b(r — 1) = (bx(r — 1), . . . , b n (r — 1)), 
performs the following two steps. 

1. k balls are placed into bins chosen according to p. 

2. After k balls are located, define b(r) = (&i(r), . . . , b n (r)) by sorting bins in decreasing order of bin 
load. 

Similarly, let p = (px, ■ ■ ■ ,p n ) denotes the probability vector in the single-choice scheme. In each round of 
the single-choice process, k balls are placed into k randomly chosen bins, which can be called (k, A;)-choice 
process. Observe that 

Pi < ■■■ <Pn 

PX = ...=p n = l-(l-l/n) k . 

and see Figure 2. 

Definition 2. For given probability vectors p = (px, . . . ,p n ) and p = (px, . . . ,p n ), let x* satisfy the 
following properties: 

Px — Px, if % — ^* 
Px > Px, otherwise. 

We call (x* , p Xt ) the intersection point ofp and p, denoted by p H p. 
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In Section 3.1 we show that the two x-coordinates of the intersection points of the bin load vectors and 
probability vectors coincide, and compute x* using p and p. 



3 Analysis 

In this section, we compute tight bounds of y* and y* . Recall that is the y-coordinate of the intersection 
point between the load vectors b = (61, ... , b n ) and b = (pi, ... , b n ), and y* is the gap between 61 and b Xit . 
Hereafter, bin x denotes the x'fh most load bin. In the following lemma, we compute p x , the probability 
that bin x receives at least one ball in a round. 

Lemma 1. For 1 < x < n, the probability that bin x receives at least one ball in a round is 

d \f/x\<i-t+s fx-r\ d - t+i \ , x\t-i 

»-£{ d - h+j )[ti -{—) JO") • < 3 > 

Proof. Fix 1 < x < n. We call bin 1 ,. . . , bin x heavy bins and bin x + 1, bin n light bins. First we 
compute the probability that some of the heavy bins receive at least one ball in a round, denoted by p< x . 
Later, p x can be easily obtained by slightly modifying p< x . 

Suppose that d bins are chosen i.u.r. in a round such that j bins are chosen from the light bins i.u.r. and 
d — j bins are chosen from heavy bins for < j < d. If j > k, then all k balls go to the sampled light bins 
and none of the heavy bins get a ball. If j < k, then j balls go to the j bins chosen from the light bins and 
hence k — j balls are placed into some of the heavy bins. Considering all the cases j < k, we have 

j=0 KJ/ 

bins chosen from the heavy bins do not contain bin x. This can 



For p x , we exclude the case that the d — j 

be done by replacing the factor J in ( 4 1 with ( ( ^ ) d ~ fc+,J — ( ) d_ k+3 \ , and then we have ( 3 1 . 



□ 



3.1 Computing .x* 

Computing the intersection point directly from the load vectors seems very difficult since fr^'s are random 
variables. Thus we use the probability vectors p = (pi, . . . ,p n ) and p = (pi, . . . ,p n ) instead. Computing 
intersection point between two known probability vectors is a lot easier than computing two load vectors of 
unknown random variables. Of course, we first prove that the x-coordinates of b n b and pflpdo coincide 
in the following Lemma. 

Lemma 2. Let x* be the x-coordinate of the intersection point between p and p. Then, for any s > 0, 

Pr(b x >s)> Pr(b x > s), ifx>x* (5) 
PrOx > s) < Pr(b x > s), if x < x*. (6) 

Proof. Let u = and u = t£tt = (1/n, . . . , 1/n), where ||v|| denotes for v = (v\,... ,v n ). 

Obviously, if x* is the intersection of p and p then it is the intersection between u and u. Without loss of 
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generality, we assume that u Xst = u Xt . Note that (k, d)-choice can be viewed as the following serialized 
allocation process: each ball is assigned to a bin sampled according to the distribution u. 

Let u m i n = min{u, u} = (u\, . . . , u Xt , u x *+i, • • • , u n ). Now we define two processes running simul- 
taneously as follows. In process 1, each time a ball is placed into a bin, it uses the distribution n Umm n with 

probability 1 1 u m j n 1 1 , and the distribution "Tj"" 1 '",, with probability 1 — 1 1 u m j n 1 1 . Each time a ball is placed 
into a bin in process 1 according to the distribution i| Um '"n , we place a new ball into the same bin number 
in process 2. However, if the ball in process 1 follows the distribution "Tj""" 1 ,, , we place a new ball into a 

-I- || u min|| 

bin using the distrbution j^rj—^Ti ■ It is easy to see that process 1 and 2 are equivalent to (k, d) -choice and 
single-choice processes. Note that in spite of bin sorting after each ball is inserted, the bin load vectors in 
process 1 and 2 satisfy the following property: b x > b x for x > x*, and b x i < b x > for x' < x*. We have 
shown that for each instance of the (k, <i)-choice process, we can define an instance of the single choice 
process for which 

b x < bx, for x < x*, 

b x > b x , for x > x* (7) 

hold. Similarly, we can prove that for each instance of the single-choice process, there is an instance of the 
(fc, (i)-choice process satisfying the property □ 

Now we compute the x-coordinate of intersection of p and p in the following Lemma. The proof is in 
Appendix. 

Lemma 3. For d^ > 2, the x-coordinate of the intersection between p and p is 

x* = e(— ). (8) 

dk 

3.2 Computing 

As defined in [2], the height of a ball is one more than the balls already in the bin at the time of insertion. We 
define two random variables in the single-choice scheme: fly for the number of balls with height at least y 
and v y for the number of bins with at least y balls. In the following Lemma, we present some useful bounds 
in the single-choice algorithm. Its proof is in Appendix. 

Lemma 4. 

Pr(^ % > — J <e 3 H! , 
„ / n \ 

In the next lemma, we compute , one of the two terms in the maximum load. 
Lemma 5. Let 1 < k < d < n. Then, with probability 1 — o(l), 

y ^ = 1(^0(1))^, ifd k ^oo, (9) 



O(l), ifd k = 0(1). 
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Proof. By the definition of we have = b Xt and u yt , = x*. By Lemma |4j with probability 

l-o(l), 



n on 
< v n . < 



6(y*)! " " (y*)! 
Using Lemma[2]and [3j y* satisfies the following equation: 



n n 



dk (y*V- 

for some constant C > 0. Thus y* satisfies yj = which yields ^J. □ 
3.3 Computing y* 

In this section, we show that the bin load decreases very slowly between bin 1 and bin sc*. Our analysis 
adopt the layered induction approach used in the standard multiple choice analysis in O. First, we define 
two random variables for the (k, <i)-choice analysis that correspond to ft y and v y used in the single-choice 
process. By (j, y , we denote the number of balls with height at least y after n balls are placed into bins in the 
(k, (i)-choice process. We use v y {r) to represent the number of bins that contains at least y balls at the end 
of r-th round. For simplicity, v y to denote u y (n/k). The following Lemma is used frequently in the rest of 
this paper. 

Lemma 6. Let X be the sum of binary random variables which are negatively associated. That is X = 
Xi such that Ep^X,-] < EfX^Ep^-]. 

i) 

E[X 2 } < E[X] + (E[X]) 2 (10) 

ii)IfE[X] < n, then 

Pr(X > 2/x) < e - ^ 3 . (11) 

Hi) IfE[X] > n, then 

Pr(X > -n) < e^^ 8 . (12) 

Proof. A proof of part i) is in lfl4ll . and part ii) and iii) can be proved by slightly modifying the basic 
Chernoff bounds HI1. □ 

Corollary 1. Lemma ^holds for X = v y , u y , and fly, for any y > 0. 

Proof. See Appendix. □ 

Now we are ready to compute y* . This section is devoted to prove the following Lemma. 
Lemma 7. For any 1 < k < d < n, 

lo&log(n/du) . , 

y* ^ r^ihru + 1 (13) 

log(a — k + 1) 

holds with probability 1 — o (-). 
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Proof. Recall that y* = (1 + o(l)) lo g^ g fc d , and i/^ < cj^, for some constant c > 0. In the following, we 
show that with high probability the sequence of u Vm , u yt +i, ■ ■ ■ decreases doubly exponentially until v y < 
12 log n. Let 1 < r < n/ k deno te the r'th round in the (k, d)-choice process. Let A y (r) = v y (r)— u y {r— 1). 

Then, v v = Er=i A y( r )- 

Fix y > y* . Let X r ^ y be a random variable that represents the number of balls with height at least y + 1 
at the end of r'th round. Then, for 1 < j < k, 

p r (^>,K)<( d j +i )er +j . a* 

The above inequality can be easily obtained from Lemma|IJ Let p y j = ( d _ d k+ ^) { L ^) d fc+J forj = l,...,k. 
We bound the conditional expectation of A y+ i(r) as follows: 

k 

E[Aj, + i(r) | u y ] < ^2jp y ,j 
i=i 

Since y > y*, we have 

(k- j)Uy Uyk Uyd k 

Pv^ 1 ~ n ( d - k + j + lf y ' j - n(d-k + 2) Py ' j ~ n Py ' J ~ CPy ' j ' 



Hence, 



k 

E[Aj,+i(r) | u v ] <ft,,i JV"^ 

3=1 

Py,i 



< 



Therefore, we have 



(1 - c) 2 k 

Using the fact that u y < \i y < p, y and Lemma |6j 



(1 - cf 



n/k 

nu y+1 \uy}=Y j my + i(r)\^} (15) 
< r j^r- (16) 



^ 2 (1 _ C )2 k 



(l-c) 2 \d-k + lj \ nJ k 



u y \d-k+i n 



(17) 



holds with probability 1 — o(l/n 2 ). We apply the recurrence relation 17 to each v y with y > y* and obtain 
that 

/ o ( d ^+(d-k+i)+...+{d-k+i)y-y*- 1 u^sv-v. 
. [ 2 ( d _ fc+ iJ ^ ( vy\{d-k+i)y y* 

yy - Ui-cY k IvJ n (18) 
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holds with probability 1 — o ( JL Ir 1 ). Now, we use the following inequality to ( 18 ) 

( d ) 

SddS±lL < (3d k ) d ~ k . (19) 
k 

which can be obtained using the Stirling's formula and the fact that (^) k /( d - k ) < 3. Then 



(d- fc+ i)M-;/*_i 
2 , , . rf A 3=5 / -v v \(d-k+l)v-v* 



Now we choose c = ^ so that (j—p^ fc^ 1 = (i_ c ) 2 C — 5- ^Therefore, 
Thus, if 

loglogn/4 

y - y* < 



log(d - fc + 1) ' 

then z/ y < 12 logn. It remains to prove that there are few more steps left to make v y < 1. The rest of the 
proof has been moved to Appendix. □ 

Finally, we show that the upper bound presented in Lemma|7]is essentially tight by providing a matching 
lower bound. 



Lemma 8. For any 1 < k < d < n, 



holds with probability 1 — o (^). 



log (a — k + 1) 



Proof. The proof we provide here is an extension of the analysis used in . For simplicity, let denote 
n/k. Let Ri = n&(l — 2~ l ). Divide the whole range of rounds [0, n/k] into subintervals as follows. 

[0,n k ]=U y * =0 [R y ,R y+1 ). 

Let Vi(Ri) denotes the number of bins with at least i balls at the end of the last round in Ri. We construct a 
sequence of satisfying vi(Ri) > whp. Define 

7o = n 

7i+1 2*+5 \d - k + l) [ n ] 

The lower bound of y* can be obtained by finding maximum i such that 7i > 8 log n and the rest of the 
proof has been moved to Appendix. □ 



2 If c > i, then u Vt +i < xj • Thus we can simply replace y* by y» + 1. That is, w.o.l.g. we can assume that c is an arbitrary 
constant > 0. 
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4 Conclusion and Future Work 



In this paper we studied the (k, <i)-choice process. We provided tight bounds on the maximum load and 
showed that, for suitably chosen k and d, the (k, d) -choice strategy achieves the optimal tradeoffs between 
the maximum load and message cost. 

There are many open questions remained about the (k, <i)-choice process. One of them is the maximum 
load for the case of m > n balls and n bins. Considering that fundamental differences between single 
and multiple choice processes become more pronounced as m grows, it is interesting to know how the 
(k, d) -choice will behave as m increases. 

Finally, let's consider the following simple modification of (k, (i)-choice. In each round, d bins are 
chosen randomly and k balls are placed into the bins in the most optimal way: less loaded candidate bins 
receive more balls. For example, in the (2, 3)-choice, assume that 3 random bins are chosen randomly, each 
contains 0, 2, 3 balls each. Then the two ball are placed into the empty bin, rather than going to an empty 
and the bin with load 2. We expect that this modification reduces the maximum load significantly, especially 
when the number of balls grows beyond the number of bins. 
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5 Appendix 

5. 1 Proof of Lemma |3] 

Proof. Even though the values of p x and p x are known explicitly, it is still hard to compute the solution x* 
that satisfies p x = p x . Instead of finding exact value of x* , we will find an upper and lower bound of x* 
and show that both are of 6 ( ^ j . To do so, we will find upper and lower bounds pi , p u for p x , and p\ , p u 
for p x , i.e., pi < p x < p u and p\ < p x < p u - Then the solution x that satisfies p\ = p u will be an upper 
bound of x*, and the solution x that satisfies p u = p~\ will be a lower bound of x*. 

First, we compute a lower bound of x* . Since p x = 1 — ( 1 — - ) k , we have pi : = ^ < p x < p u : = ^ . 
Applying the union bound to equation ([3]) , we obtain the upper bound of p x as 



In the second inequality above, we used the mean value theory. To find a lower bound of x*, we find x 
that equalizes the upper bound of p x and the lower bound of p x we have obtained. It remains to solve the 
following equation 





which is equivalent to 
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Using the fact that ( d d k ) = (c • dk) d k for some constant 1 < c < 3, the solution is x = © fgH ■ 

Next, we find an upper bound of x*. Computing an upper bound of is more involved. Recall that p<- 
is the probability that at least 1 ball is placed into bins < x. We proved that in Lemma [T] 



where 



k 

P<x = ^2p<x(j), 
3=1 



\d — k + j J \n, 

Let q< x (j) = ( d _ d k+ ) (^) d k+J ■ By the inclusion-exclusion principle fT6l . 



k 

P<* = Z)(-l) i+1 ?<xO')- (2D 

3=1 



From Lemma [T] 



3=1 

where 

d \ ( rx\d-k+j fx - 1\ d ~ fc+ A / 



Let = ^(j) - q< x -i(j) = { d _ d k+j ) {(^) d - k+ i - (^) d - fc+J ). Using the fact that p x (j) > 

P<x(j) -P<x-l{j) and ©' 

> P<z -P<x-1 

= £(-l) J ' +1 ?*tf). 

For any odd integer j < k, 

q x (j) -Q*(3 + 1 )=( d _ d k+ •) (/i(^) " " !)) 
where /,(*) = (l - 3=^*). By theM.V.T., 



mvi - f-d- - -n - d k + i ( x ' \d-k+j-i n _ k 1 A _ fc j Lr_^- fc +j 

/jW j " n { n> 1 d-Hi + ln J d - k + 7 + 1 n [ n > 

1 ,x'. 



d— k+j + ln d — k + j + 



n n 



((d-fc + j)-(fc-j)£) 
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for x' e[x — 1, x]. Let ^ = 

Suppose that 1 < k < d — 3. (The rest of cases in which k = l,d — 2, or d — 1 can be handled 
separately). Then 

-*<*-!)> 

q x (l)-q x (2) > k g(k) 
n 

> k g(d-3) 
n 

_k 1 (d\, 3 3 

fc 27 

> 

~ n24 

> . 

n 

Thus, we conclude that 

Px > - fe(2) > -• 
n 

Hence, 

n n 

x * ~ (k-l)/(d-k) ~ 2 4' 

□ 

5.2 Proof of Lemma U 

Proof. Fix y > 1. Define random variable H\(y) to represent the number of balls in bin 1 whose height at 
least y. That is, 

H 1 (y) = [y' -y + l] + 

where y' is the number of balls in bin 1 and [z] + = max{z, 0}. Let L\{y) be a binary random variable 
defined as L y = 1 if and only if bin 1 contains at least y balls. Then 

n 
n 
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For any y' > y, we have 

Pr(ffi(y) =y'-y+l) 



n 



1 



1 



n—y 



n 



y' + 1 J \n) 



> 



n 

y' + l 



y'+l 
y'+l 



iy T 7 i 



72 



77 



n-y'-l 



n-l)(y' + l) 



n — y 



n-y'-l 



(y' + i) 



Thus 



= ( 2 / , + l)Pr( J ff 1 (y)=y / -y + 2) 
>2Pr( J ff 1 (y) = y / -y + 2). 



E[fl" 1 (y)]= J^iPr(lTi(y)=i) 

<Pr(i? 1 (y) = l)^i(^ 
<4Pr(F 1 (y) = l). 



We can bound ¥i{H\{y) = 1) as 
1 



3y! 



< Pr(ffi(y) = 1) 



y/ V n / V n 



77 



y/ V n / 



< 



Combining ( 22 1 and ( 23 1 we obtain 



E[fi y ] < 



An 



Using E[Li(y)] > Pr(i?i(y) = 1) and <|23j we have 



n 

3y! 



(22) 

(23) 

(24) 

(25) 
□ 



Applying Lemma[6]to (24 ) and (25 1 yields the desired results. 
5.3 Proof of Corollary g 

Proof. Consider that n balls are placed into n bins by the single-choice process. Fix y > 0. Let and B x j, 
be binary random variables defined as 

L x = 1 if and only if load of bin x is at least y balls 

= 1 if and only if ball b goes into bin x and height of b is at least y 

It is easy to show that L x s are negatively associated, and hence Lemma[6]applies to u y = L x . Similarly 
we can show that Lemma [6] applies to v y . 

Since, for each b, Ylx=i * s either or 1, {B x y, x = 1, . . . , n} are negatively associated. Further- 
more, since for any b ^ b' , {B x ^} x are mutually independent, {B x ^} x ^ are negatively associated (See 0). 
We conclude that Lemma[6]holds for JX y = J2 X b Bx,b- D 
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5.4 Proof of Lemma 

Here we complete the proof of Lemma [7] Now we deal with the case u y < 12 log n. Let yi be the smallest 
y such that u y < 12 log n. We have shown that 

loglog(n/4) 
m ~ y *- d-k + l 

holds with probability 1 — o(l/n). In the following analysis, we assume that du <n/ log 3 n. |^Let y > y2- 
By (TT6]) and ([19]), 

d » ' v y2\d-k+i n 



E[^ 2+1 |^]<l.l( d _^ + i j 



d — A; + 1 y n A: 
1 1.1 U^V^^n 



n / n 



< 1.1 

n 

<4^K 2 ) 2 = o(l). 



By the Markov inequality, we have fy 2 +i < 1 whp and hence we let 

i -, . loglogtn/dfc) 

y = 2/2 + i - y* < — i — , , : + l. 

a — k + 1 

5.5 Proof of Lemma |8] 

Proof. Let r G Define binary random variable Z r as 

{exactly one out of k balls at round r has height i + 1 or 
i/ i+ i(r - 1) > 7i+i. 



Z r = 1 if and only if 

Then 



Pr(Z r = 1 | Vi {r - 1), Vi+i(r - 1)) 

„ , d \ ( f vi{r - l) \ d - h+1 ( v i+1 (r - l) \ d ~ k+1 \ / ^(r-1)^ 



d — fc + ly I \ n y \ n J J \ n 

Assuming that vi{Ri) > ji, we prove that Vi + i{Ri + \) > holds whp. We need to prove that if 
v i+1 (r - 1) > Ji+i for some r £ then i/ i+1 (.R i+1 ) > 7 i+1 . Suppose that z/ i+ i(r - 1) < Ji+i. 

Using 7j + i < 7j/2 < i/j(r — l)/2, we have 



Pr(Z r = 1) > 1 



d \ A Wj(r — 1)\ X / /fj(r-l)\ d fe+1 /7 i+ i 



d — A; + 1 / V n 7 \ \ re / ^ n 

> U d W ^( r -l)^-V^(r-l)^- fc+1 



d-fc+l 



2 yd — fe + ly \ n y \ n 

_ 2 Id - fc + 1/ V n) n' 



3 If dft > n/log 3 n, the contribution of y* to the maximum load becomes negligible. In this case, the maximum load is 
determined by y, . 
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If T« < fcTT'then 



2 3 \d-k + l, 

and 



By Lemma|6] 



Pr( ^ Z r < 7i+1 ) < e"^ 1 / 4 . 
re[Ri,Ri+i) 

log log n/d fc 



Let 12 be the largest i such that 7« > 8 log n to obtain y* > 12 = d-k+i k ~ ^(-0- ^ 
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